Picture for Enze Xie

Enze Xie

Light Interaction: Training-Free Inference Acceleration for Interactive Video World Models

Add code
May 29, 2026
Viaarxiv icon

Fast-dDrive: Efficient Block-Diffusion VLM for Autonomous Driving

Add code
May 25, 2026
Viaarxiv icon

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Add code
May 19, 2026
Viaarxiv icon

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Add code
May 14, 2026
Viaarxiv icon

Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM

Add code
Apr 08, 2026
Viaarxiv icon

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Add code
Apr 08, 2026
Viaarxiv icon

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Add code
Jan 12, 2026
Viaarxiv icon

Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

Add code
Dec 16, 2025
Figure 1 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Figure 2 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Figure 3 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Figure 4 for Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Viaarxiv icon

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Add code
Oct 05, 2025
Viaarxiv icon

LongLive: Real-time Interactive Long Video Generation

Add code
Sep 26, 2025
Viaarxiv icon